Search CORE

45 research outputs found

Structure- and context-based analysis of the GxGYxYP family reveals a new putative class of glycoside hydrolase.

Author: Chang Yuanyuan
Eberhardt Ruth Y
Gilbert Harry J
Godzik Adam
Rigden Daniel J
Xu Qingping
Publication venue: eScholarship, University of California
Publication date: 01/06/2014
Field of study

BackgroundGut microbiome metagenomics has revealed many protein families and domains found largely or exclusively in that environment. Proteins containing the GxGYxYP domain are over-represented in the gut microbiota, and are found in Polysaccharide Utilization Loci in the gut symbiont Bacteroides thetaiotaomicron, suggesting their involvement in polysaccharide metabolism, but little else is known of the function of this domain.ResultsGenomic context and domain architecture analyses support a role for the GxGYxYP domain in carbohydrate metabolism. Sparse occurrences in eukaryotes are the result of lateral gene transfer. The structure of the GxGYxYP domain-containing protein encoded by the BT2193 locus reveals two structural domains, the first composed of three divergent repeats with no recognisable homology to previously solved structures, the second a more familiar seven-stranded β/α barrel. Structure-based analyses including conservation mapping localise a presumed functional site to a cleft between the two domains of BT2193. Matching to a catalytic site template from a GH9 cellulase and other analyses point to a putative catalytic triad composed of Glu272, Asp331 and Asp333.ConclusionsWe suggest that GxGYxYP-containing proteins constitute a novel glycoside hydrolase family of as yet unknown specificity

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Structural genomics analysis of uncharacterized protein families overrepresented in human gut bacteria identifies a novel glycoside hydrolase.

Author: Axelrod Herbert L
Chang Yuanyuan
Eberhardt Ruth Y
Godzik Adam
Li Zhanwen
Rigden Daniel J
Sheydina Anna
Zmasek Christian C
Publication venue: eScholarship, University of California
Publication date: 01/01/2014
Field of study

BackgroundBacteroides spp. form a significant part of our gut microbiome and are well known for optimized metabolism of diverse polysaccharides. Initial analysis of the archetypal Bacteroides thetaiotaomicron genome identified 172 glycosyl hydrolases and a large number of uncharacterized proteins associated with polysaccharide metabolism.ResultsBT_1012 from Bacteroides thetaiotaomicron VPI-5482 is a protein of unknown function and a member of a large protein family consisting entirely of uncharacterized proteins. Initial sequence analysis predicted that this protein has two domains, one on the N- and one on the C-terminal. A PSI-BLAST search found over 150 full length and over 90 half size homologs consisting only of the N-terminal domain. The experimentally determined three-dimensional structure of the BT_1012 protein confirms its two-domain architecture and structural analysis of both domains suggests their specific functions. The N-terminal domain is a putative catalytic domain with significant similarity to known glycoside hydrolases, the C-terminal domain has a beta-sandwich fold typically found in C-terminal domains of other glycosyl hydrolases, however these domains are typically involved in substrate binding. We describe the structure of the BT_1012 protein and discuss its sequence-structure relationship and their possible functional implications.ConclusionsStructural and sequence analyses of the BT_1012 protein identifies it as a glycosyl hydrolase, expanding an already impressive catalog of enzymes involved in polysaccharide metabolism in Bacteroides spp. Based on this we have renamed the Pfam families representing the two domains found in the BT_1012 protein, PF13204 and PF12904, as putative glycoside hydrolase and glycoside hydrolase-associated C-terminal domain respectively

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Two Pfam protein families characterized by a crystal structure of protein lpg2210 from Legionella pneumophila.

Author: Aravind L
Axelrod Herbert L
Bateman Alex
Chang Yuanyuan
Coggill Penelope
Das Debanu
Eberhardt Ruth Y
Finn Robert D
Godzik Adam
Jaroszewski Lukasz
Murzin Alexey G
Xu Qingping
Publication venue: eScholarship, University of California
Publication date: 01/09/2013
Field of study

BackgroundEvery genome contains a large number of uncharacterized proteins that may encode entirely novel biological systems. Many of these uncharacterized proteins fall into related sequence families. By applying sequence and structural analysis we hope to provide insight into novel biology.ResultsWe analyze a previously uncharacterized Pfam protein family called DUF4424 [Pfam:PF14415]. The recently solved three-dimensional structure of the protein lpg2210 from Legionella pneumophila provides the first structural information pertaining to this family. This protein additionally includes the first representative structure of another Pfam family called the YARHG domain [Pfam:PF13308]. The Pfam family DUF4424 adopts a 19-stranded beta-sandwich fold that shows similarity to the N-terminal domain of leukotriene A-4 hydrolase. The YARHG domain forms an all-helical domain at the C-terminus. Structure analysis allows us to recognize distant similarities between the DUF4424 domain and individual domains of M1 aminopeptidases and tricorn proteases, which form massive proteasome-like capsids in both archaea and bacteria.ConclusionsBased on our analyses we hypothesize that the DUF4424 domain may have a role in forming large, multi-component enzyme complexes. We suggest that the YARGH domain may play a role in binding a moiety in proximity with peptidoglycan, such as a hydrophobic outer membrane lipid or lipopolysaccharide

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

LUD, a new protein domain associated with lactate utilization.

Author: Aravind L
Axelrod Herbert L
Bakolitsa Constantina
Bateman Alex
Coggill Penelope C
Eberhardt Ruth Y
Godzik Adam
Hwang William C
Pascual Jaime
Peterson Scott N
Punta Marco
Rawlings Neil D
Sedova Mayya
Publication venue: eScholarship, University of California
Publication date: 01/11/2013
Field of study

BackgroundA novel highly conserved protein domain, DUF162 [Pfam: PF02589], can be mapped to two proteins: LutB and LutC. Both proteins are encoded by a highly conserved LutABC operon, which has been implicated in lactate utilization in bacteria. Based on our analysis of its sequence, structure, and recent experimental evidence reported by other groups, we hereby redefine DUF162 as the LUD domain family.ResultsJCSG solved the first crystal structure [PDB:2G40] from the LUD domain family: LutC protein, encoded by ORF DR_1909, of Deinococcus radiodurans. LutC shares features with domains in the functionally diverse ISOCOT superfamily. We have observed that the LUD domain has an increased abundance in the human gut microbiome.ConclusionsWe propose a model for the substrate and cofactor binding and regulation in LUD domain. The significance of LUD-containing proteins in the human gut microbiome, and the implication of lactate metabolism in the radiation-resistance of Deinococcus radiodurans are discussed

PubMed Central

eScholarship - University of California

New mini- zincin structures provide a minimal scaffold for members of this metallopeptidase superfamily

Author: Christine B Trame
Herbert L Axelrod
Marco Punta
Neil D Rawlings
Penelope Coggill
Ruth Y Eberhardt
Yuanyuan Chang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Springer - Publisher Connector

Recommended from our members

Evaluating variants classified as pathogenic in ClinVar in the DDD Study.

Author: Constantinou Panayiotis
DDD Study
Eberhardt Ruth Y
Firth Helen V
FitzPatrick David R
Hurles Matthew E
Wright Caroline F
Publication venue: Genet Med
Publication date: 05/03/2021
Field of study

PURPOSE: Automated variant filtering is an essential part of diagnostic genome-wide sequencing but may generate false negative results. We sought to investigate whether some previously identified pathogenic variants may be being routinely excluded by standard variant filtering pipelines. METHODS: We evaluated variants that were previously classified as pathogenic or likely pathogenic in ClinVar in known developmental disorder genes using exome sequence data from the Deciphering Developmental Disorders (DDD) study. RESULTS: Of these ClinVar pathogenic variants, 3.6% were identified among 13,462 DDD probands, and 1134/1352 (83.9%) had already been independently communicated to clinicians using DDD variant filtering pipelines as plausibly pathogenic. The remaining 218 variants failed consequence, inheritance, or other automated variant filters. Following clinical review of these additional variants, we were able to identify 112 variants in 107 (0.8%) DDD probands as potential diagnoses. CONCLUSION: Lower minor allele frequency (1 star) are good predictors of a previously identified variant being plausibly diagnostic for developmental disorders. However, around half of previously identified pathogenic variants excluded by automated variant filtering did not appear to be disease-causing, underlining the continued need for clinical evaluation of candidate variants as part of the diagnostic process

Apollo (Cambridge)

Detecting cryptic clinically relevant structural variation in exome-sequencing data increases diagnostic yield for developmental disorders.

Author: Danecek Petr
Eberhardt Ruth Y
Firth Helen V
FitzPatrick David R
Gallone Giuseppe
Gardner Eugene J
Hurles Matthew E
Lindsay Sarah J
Martin Hilary C
Prigmore Elena
Rajan Diana
Sifrim Alejandro
Wright Caroline F
Publication venue: Am J Hum Genet
Publication date: 10/10/2021
Field of study

Structural variation (SV) describes a broad class of genetic variation greater than 50 bp in size. SVs can cause a wide range of genetic diseases and are prevalent in rare developmental disorders (DDs). Individuals presenting with DDs are often referred for diagnostic testing with chromosomal microarrays (CMAs) to identify large copy-number variants (CNVs) and/or with single-gene, gene-panel, or exome sequencing (ES) to identify single-nucleotide variants, small insertions/deletions, and CNVs. However, individuals with pathogenic SVs undetectable by conventional analysis often remain undiagnosed. Consequently, we have developed the tool InDelible, which interrogates short-read sequencing data for split-read clusters characteristic of SV breakpoints. We applied InDelible to 13,438 probands with severe DDs recruited as part of the Deciphering Developmental Disorders (DDD) study and discovered 63 rare, damaging variants in genes previously associated with DDs missed by standard SNV, indel, or CNV discovery approaches. Clinical review of these 63 variants determined that about half (30/63) were plausibly pathogenic. InDelible was particularly effective at ascertaining variants between 21 and 500 bp in size and increased the total number of potentially pathogenic variants identified by DDD in this size range by 42.9%. Of particular interest were seven confirmed de novo variants in MECP2, which represent 35.0% of all de novo protein-truncating variants in MECP2 among DDD study participants. InDelible provides a framework for the discovery of pathogenic SVs that are most likely missed by standard analytical workflows and has the potential to improve the diagnostic yield of ES across a broad range of genetic diseases

RD&E Research Repository

Apollo (Cambridge)

The contribution of X-linked coding variation to severe developmental disorders

Author: Akawi Nadia
Eberhardt Ruth Y.
Firth Helen V.
Fitzpatrick David R.
Gallone Giuseppe
Gardner Eugene J.
Hurles Matthew E.
Kaplanis Joanna
Martin Hilary C.
McRae Jeremy
Neville Matthew D. C.
Niemi Mari E. K.
Samocha Kaitlin E.
Sifrim Alejandro
Tavares Ana Lisa Taylor
Wright Caroline F.
Publication venue
Publication date: 27/01/2021
Field of study

Over 130 X-linked genes have been robustly associated with developmental disorders, and X-linked causes have been hypothesised to underlie the higher developmental disorder rates in males. Here, we evaluate the burden of X-linked coding variation in 11,044 developmental disorder patients, and find a similar rate of X-linked causes in males and females (6.0% and 6.9%, respectively), indicating that such variants do not account for the 1.4-fold male bias. We develop an improved strategy to detect X-linked developmental disorders and identify 23 significant genes, all of which were previously known, consistent with our inference that the vast majority of the X-linked burden is in known developmental disorder-associated genes. Importantly, we estimate that, in male probands, only 13% of inherited rare missense variants in known developmental disorder-associated genes are likely to be pathogenic. Our results demonstrate that statistical analysis of large datasets can refine our understanding of modes of inheritance for individual X-linked disorders. Developmental disorders (DDs) are more prevalent in males, thought to be due to X-linked genetic variation. Here, the authors investigate the burden of X-linked coding variants in 11,044 DD patients, showing that this contributes to similar to 6% of both male and female cases and therefore does not solely explain male bias in DDs.Peer reviewe

Enlighten

The University of Manchester - Institutional Repository

Helsingin yliopiston digitaalinen arkisto

University of Dundee Online Publications

Apollo (Cambridge)

White Rose Research Online